Semantic-based Estimation of Term Informativeness
نویسنده
چکیده
The idea that some words carry more semantic content than others, has led to the notion of term specificity, or informativeness. Computational estimation of this quantity is important for various applications such as information retrieval. We propose a new method of computing term specificity, based on modeling the rate of learning of word meaning in Latent Semantic Analysis (LSA). We analyze the performance of this method both qualitatively and quantitatively and demonstrate that it shows excellent performance compared to existing methods on a broad range of tests. We also demonstrate how it can be used to improve existing applications in information retrieval and summarization.
منابع مشابه
Measuring Term Informativeness in Context
Measuring term informativeness is a fundamental NLP task. Existing methods, mostly based on statistical information in corpora, do not actually measure informativeness of a term with regard to its semantic context. This paper proposes a new lightweight feature-free approach to encode term informativeness in context by leveraging web knowledge. Given a term and its context, we model contextaware...
متن کاملIVal – An Alternative WordNet Browser for Evaluating Semantic Informativeness of Concepts
This paper describes theoretical and practical aspects of a procedure for calculating semantic informativeness of concepts on the basis of WordNet. The IVal system is introduced which provides enhanced functionality for accessing the WordNet database including the computation of the concept’s informativeness, the decompositional analysis of terms, and an interface for extending the lexicon.
متن کاملThe Semantics of the Word Istikbar (Arrogance) in the Holy Quran based on Syntagmatic Relations(A Case Study of Semantic Proximity and Semantic Contrast)
The word istikbar (arrogance) is one of the key words in the monotheistic system of the Quran, which has found a special status as a special feature of the opponents and adversaries of the call to the truth. Given the prominent role of this issue in the human life system and its provision of corruption and moral deviations, it is necessary to represent the nature of the elements that make up th...
متن کاملSemantic Textual Similarity in Quality Estimation
Quality Estimation (QE) predicts the quality of machine translation output without the need for a reference translation. This quality can be defined differently based on the task at hand. In an attempt to focus further on the adequacy and informativeness of translations, we integrate features of semantic similarity into QuEst, a framework for QE feature extraction. By using methods previously e...
متن کاملA Comparative and Contrastive Study on the Meaning Extension of Color Terms in Persian and English
We deal with a wide range of colors in our daily life. They are such ubiquitous phenomena that is hard and next to impossible to imagine even a single entity (be it an object, place, living creature, etc) devoid of them. They are like death and tax which nobody can dispense with. This omnipresence of colors around us has also made its way through abstract and less tangible entities via the inte...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009